An Evaluation of Deeply Decoupled Cores
نویسندگان
چکیده
The trend towards larger structures and aggressive clock frequencies has been a fundamental driving force for modern microprocessor design. While one approach is to deeply pipeline any high delay structure, dependencies and critical loops have made it increasingly difficult to speed execution through extensive pipelining. One alternative is to remove larger structures from the critical path. We explore the ramifications of stripping all but the most necessary functionality out of the processing core, leaving only a tiny -core. Although past studies have shown the possibility to build decoupled structures for some individual helper structures, the impact of streamlining all of these structures at the same time has not been explored. Along with describing the challenges in decoupling the helper engines, we focus on the performance, power consumption and thermal behavior of the -core architecture. We use a detailed performance, power and thermal modeling in our analysis. Our results indicate that the -core provides a 20% reduction in power over a conventional monolithic core, while providing comparable performance (1% improvement on average). By dynamically configuring the helper engines to different application phases, an additional 13% power savings can be attained with only an average 3% degradation in performance. Our experiKURSUN, SHAYESTEH, SAIR, SHERWOOD & REINMAN mental analysis also show that the microcore architecture has favorable thermal behavior, with 86% fewer thermally-critical cycles compared to a monolithic core.
منابع مشابه
Modified-Decoupled Net Present Value: The Intersection of Valuation and Time scaling of Risk in Energy Sector
Although the practical importance of investment analysis in long-term energy investments is well understood, choosing the proper method has always been a dilemma. In this regard, classic evaluation methods, with a history of almost a century, are mostly favored, but using them in the valuation of long-lasting energy projects has particular shortcomings, nevertheless. The drawbacks mainly stem f...
متن کاملArea-Efficient Evaluation of a Class of Arithmetic Expressions Using Deeply Pipelined Floating-Point Cores
Due to technological advances, it has become possible to implement floating-point cores on FPGAs in an effort to provide hardware acceleration for the myriad applications that require high performance floating-point arithmetic. However, in order to achieve a high clock rate, these floating-point cores must be deeply pipelined. Due to this deep pipelining and the complexity of floating-point ari...
متن کاملArea-Efficient Evaluation of Arithmetic Expressions Using Deeply Pipelined Floating-Point Cores
Due to technological advances, it has become possible to implement floating-point cores on FPGAs in an effort to provide hardware acceleration for the myriad applications that require high performance floating-point arithmetic. However, in order to achieve a high clock rate, these floating-point cores must be deeply pipelined. Due to this deep pipelining and the complexity of floating-point ari...
متن کاملLow-Overhead Core Swapping for Thermal Management
Technology scaling trends and the limitations of packaging and cooling have intensified the need for thermally efficient architectures and architecture-level temperature management techniques. To combat these trends, we evaluate the thermal efficiency of the microcore architecture, a deeply decoupled processor core with larger structures factored out as helper engines. We further investigate ac...
متن کاملPareto Optimal Design Of Decoupled Sliding Mode Control Based On A New Multi-Objective Particle Swarm Optimization Algorithm
One of the most important applications of multi-objective optimization is adjusting parameters ofpractical engineering problems in order to produce a more desirable outcome. In this paper, the decoupled sliding mode control technique (DSMC) is employed to stabilize an inverted pendulum which is a classic example of inherently unstable systems. Furthermore, a new Multi-Objective Particle Swarm O...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Instruction-Level Parallelism
دوره 8 شماره
صفحات -
تاریخ انتشار 2006